Reliable First-Principles Alloy Thermodynamics via Truncated Cluster Expansions 
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In alloys cluster expansions (CE) are increasingly used to combine first-principles electronic- 
structure calculations and Monte Carlo methods to predict thermodynamic properties. As a basis-set 
expansion in terms of lattice geometrical clusters and effective cluster interactions, the CE is exact if 
infinite, but is tractable only if truncated. Yet until now a truncation procedure was not well-defined 
and did not guarantee a reliable truncated CE. We present an optimal truncation procedure for CE 
basis sets that provides reliable thermodynamics. We then exemplify its importance in NiaV, where 
the CE has failed unpredictably, and now show agreement to a range of measured values, predict 
new low-energy structures, and explain the cause of previous failures. 
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PACS numbers: 05.10.-a,64.70.Kb,81.30.Bx 

The cluster expansion (CE) is increasingly used as a 
valuable tool for predicting and interpreting thermody- 
namic effects in a wide class of materials and problems, 
including precipitation |^ |3|, i^], solubility limits j5j|, 
ionic diffusion [6j, surface alloying |3 and patterning 
vacancy 9] and chemical 0,^3 ordering. As a means for 
multiscaling based on density-functional theory (DFT) 
electronic-structure energetics, the CE is a basis-set ex- 
pansion in rt-body clusters (associated with n Bravais lat- 
tice points) and effective cluster interactions (ECI) that 
specify configurational energies. Except for implicit DFT 
errors in the energy database, the CE is exact for an infi- 
nite basis, but impractical if not vastly truncated 
Although there are many successes, a truncated CE can 
and has unpredictably failed. 

We present a new method for an optimal truncation of 
the basis set that gives reliable thermodynamics. We 
then detail its importance in face-centered-cubic (fee) 
NiaV, a system with order-disorder transition from dis- 
ordered A\ phase to ordered DO 22 phase at Tc of 1318 K 
[13. Previous CE for NigV 0, 111 El had errors of 
40 — 1000% for a range of thermodynamic properties, 
prompting a search for missing physics |l6|. We show 
that our new method allows more reliable predictions, 
including that of key low-energy configurational excita- 
tions. As a synopsis, we compare in Table m our CE 
results, along with the previous ones, with experimen- 
tal values of and AE^]^^"'^^^ , the energy difference 
between DO22 and metastable LI2 as assessed from the 
short-range order (SRO) measurements J^,?, 18J. The new 
CE now agrees with a range of experimentally assessed 
values (more below). We find that prior failure in NiaV 
is due to inappropriate truncation of the cluster basis 
set and overfitting to get the ECI - underscoring again 
the need for careful application of basis-set methods. We 
have tested this new CE method on a few cubic and non- 
cubic binaries and ternaries and found it to be especially 
important when multibody ECI are significant. 



Cluster Expansions: Any alloy configuration may be 
represented by a set of occupation variables {^^ }, with 

— 1(0) if the site p is (is not) occupied by an a-atom. 
Composition c" is the thermal- and site-average of {^p } 
with < c" < 1. The energy of any atomic configuration 
a can be written in a CE TT] using the n-body ECI W""* : 



Ece{<j) = V'-°^ + 



nj.d 



(1) 



Sums are over symmetry-distinct (n, /) and equivalent 
{d — 1,...,£)^"', the degeneracy) clusters. A CE basis 
can be also presented as a product of orthonormal Cheby- 
chev polynomials based on ^[ . The n-site correlation 

function ^^J^J = (^p'^p" . . . CpC^)) is given by an ensemble 

average over the fixed sets {pYpJ defining the n-body 
clusters of type (/, d), see Fig. ^ When evaluated above 
Tc, for example, are related to the SRO. If the ECI 
are known, then the energy of any configuration can be 
predicted. 

A CE can be truncated if there is rapid convergence 
of the ECI Fj"-* with increasing distance r (e.g., as 
measured by cluster radius of gyration or circumscribed 
sphere) and with increasing number of sites n in a clus- 
ter /, i.e. smaller n-body clusters are more physically 
important. Also vj"'' for n>no uncorrelated sites have 

their contributions to suppressed by ^^"^~c", i.e. 
]/(")$(") ^0, and can be neglected. The magnitudes 



TABLE I: New truncated CE (CE2 and CE3) and ex- 
perimental .I7J values of Tc (Kelvin) and the AE'^^^q^'^^^ 
(meV/atom) assessed from SRO, along with the former CE 
nil and CPA [13 results. Details in text. 





CE2 


CE3 


Expt. 


old CE 


CPA 


Tc (K) 


1335 


1370 


1318 


1900 




^^SRO 


22±16 


17±15 


12±5 


101 


7-12 



of y^"-' typically become smaller for larger n, although 
for some systems ECI convergence is not rapid: such is 
Li2.Ni02 where Jahn- Teller distortions control Li- vacancy 
ordering and ionic conduction and are reflected only in 
long-range multibody ECI 20]. For a truncated CE, 
ECI are obtained via structural inversion |2lL |23 | at 
fixed c for c-dependent (canonical) ECI or versus c for 
c-independent (grand-canonical) ECI; these sets of ECI 
are related [ISISl- First, a set of N fully-ordered (few 
atoms per cell) structures is somehow chosen and their 
DFT energies E^jpj, [i — 1..N) are calculated. Then, 
a set of M clusters {M<N) is somehow picked for use 
in and $ are calculated for each structure. A sys- 
tem of TV linear equations with M unknown ECI is 
solved by least-squares (LS) fitting - which unavoidably 
includes DFT errors in energy differences. As is obvious, 
the sets of structures and clusters used to get the ECI 
are not uniquely defined. 

New Method: Here we propose a method that, given a 
set of structural energies, unambiguously defines a set of 
clusters (and ECI) to provide an optimal truncated CE 
and yield reliable thermodynamics. First, we note that 
if T/(")(r>ro"^) = 0, the truncated CE basis with local 
compact support that includes all clusters in r^"^ is lo- 
cally complete and exact; whereas, if F(")(r>r|,"^) « 0, 
this truncated CE is approximate and has an error. With 
no a priori knowledge of which clusters are required to 
represent well a given alloy, the CE error is minimal, 
in a Rayleigh-Ritz variational sense, if all admissible n- 
body clusters (basis functions) of smaller spatial extent 
(rfCrp"'') are included before the larger ones. In brief, 
having N E\jprp to be fitted, we can establish a varia- 
tional CE basis by simple rules ad vitam aut culpam that 
implement easily computationally: 

1. If an n-body cluster is included, then include all 
n-body clusters of smaller spatial extent. 

2. If a cluster is included, include all its subclusters. 

3. To prev ent both under- and over-fittin g, m inimize 
[islli^ the cross-validation (CV) error |27ll28t: 

1 ^ 

cy'-j^Ts^FT~E^E'f- (2) 

i=l 

E^^p, in 101 is predicted by a fit to N—1 DFT energies 
excluding E\yprp^ rather than to all TV as in a LS fit. 
(This is an "exclude 1" CV, whereas an "exclude 0" CV 
is a LS fit.) While LS measures the error in reproducing 
known values of Ejjprp^ CV error estimates an uncer- 
tainty of predicted values. Both too few (underfitting) or 
too many (overfitting) parameters give poor prediction. 
The new Rule 1, with well-established Rule 2, now makes 
it easy to define uniquely all clusters in a truncated, vari- 
ational CE basis by the number of n-body clusters (or the 
size of the largest n-body cluster) for each n < no. In 




FIG. 1: Tlie 2- to 6-body clusters for 1"* and 2"'* fee 
neighbors. Ineluding clusters up to 4-body in upper set 
(6-body in both sets) form a locally complete basis in the 
range of 1^' (2"'') neighbors. These clusters form tetrahedron 
(tetrahedron-octahedron) approximations used in cluster- 
variation method JJJ. 

particular. Rules 1 and 2 permit a hierarchy of ranges 
for n-body clusters, i.e., rl^^>rQ^^^^ for all n, giving a 
locally complete basis for strict equalities, while the in- 
equahty (e.g., more 2-bodies, fewer 3-bodies, even fewer 
4-bodies, etc.) allows for fewer clusters. Because shorter- 
ranged and lower-order ECI are more important, Rule 
1 [Rule 2] prohibits excluding more important ECI and 
transferring their weight to less important longer-ranged 
[higher-order] ones. 

Once constructed, an optimal CE can be used to pre- 
dict energy of any structure within the CV error. The 
CE is valid if the lowest structural energies (including 
the ground-state) and fully-disordered state energies are 
correct within the accuracy given by the CV error. A 
valid optimal CE provides reliable thermodynamics. 

Application: We now construct and assess the new 
canonical CE for NiaV based on 45 fully-relaxed 
structural DFT energies, with relative accuracies of 
~lmeV/atom 35]. A selected set of energies is given 
in Table in 

To examine effects of the new truncation method on 
prediction, we first limit the CE basis to pairs only and 
find that the pairs-only CV is minimal for 2 (nearest 
and next-nearest) pair interactions, see Fig. 13 Within 
this range, the symmetry-distinct clusters are two pairs, 



TABLE II: Selected NisV energies (meV/atom) from DFT 
and CE relative to along with those of disordered 

state, its approximation by SQSie 29], and (001) APB. 



Structure 


Edft 


EcE2 


SEcE2 


EcEs 


SEcEa 


DO22 (1|0) 

DO23 (i|o) 


0.0 


7.1 


7.1 


10.5 


10.5 


25.4 


31.6 


6.2 


35.8 


10.4 


33.7 


31.6 


-2.1 


35.8 


2.1 


LI2 (100) 


101.2 


56.1 


-45.1 


61.1 


-40.1 




147.8 


170.6 


22.8 


154.7 


6.9 


SQS16 


155.4 


135.8 


-19.7 


152.0 


-3.5 


Disordered 


n/a 


109.8 




115.1 




{001)APB 


101.6 


98.0 


-3.6 


101.1 


-0.5 


LI2 - -DO22 


101.2 


49.0 


-52.2 


50.5 


-50.7 


DO23 - -DO22 


25.4 


24.5 


-0.9 


25.3 


-0.1 
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FIG. 2: (upper) The predicted and observed (dotted line) 
Tc (K), and (lower) the CV and LS errors (meV/atom) vs. 
ECI truncation. Results are for CE with pairs-only (left) and 
for 2 and 3 pairs with a number of 3-body ECI (right). The 
minimal CV and LS are given by filled symbols. 



two 3-bodies, three 4-bodies, a 5-body pyramid and a 
6-body octahedron, see Fig. The CE with two pair 
and two triplet interactions (denoted CE2) with minimal 
CV of 15.5 meV within this range gives Tc at 1335 K, 
near the observed 1318 K. Including tetrahedron does 
not significantly alter Tc, as expected for a 4-body clus- 
ter at c = 1/4, as c^<^l and T/W$('*)^0. For optimal 
truncation, including the most compact 4-body cluster 
is necessary before including any other more extended 
4-body or higher-order clusters. Within our set of 45 
arbitrary structures only one (147.8 me V/ atom in Table 
Inj) has contribution from the most compact 4-body clus- 
ter, so any CE including this cluster has formally infinite 
(thus not minimal) CV, so an optimal CE should contain 
pairs and triplets only. Indeed, the optimal CE (denoted 
CE3), see Fig. 121 includes 3 pair and 3 triplet interac- 
tions and yields Tc of 1370 K, again near the observed 
value and well within the CV error of ±15.2 meV. Both 
CE2 and CE3 are examples of the localized CE hierar- 
chy allowed by Rules 1-3. Notably, we find no failure or 
ill-description of thermodynamics for optimal truncation 
embodied in the new rules, see Table n 

Moreover, we predicted from these optimal CE that 
NiaV has numerous metastable long-period superstruc- 
tures (LPS) of the type (03^1) with m > 1. The DO22 
ground state is m = 1, 15023 is m = 2, and LI2 is m — 00, 
see Table ^ We then confirmed by direct DFT calcu- 
lations that over 23 metastable structures are between 



DO22 and LI2. Clearly, structural energy differences then 
will be sensitive to thermal antisites or partial-order, i.e. 
chemical environments that distinguish DO22 from other 
low-energy structures. 

The relative energies of DO22 and -DO23, which can be 
viewed as DO22 with (001) APBs, give an estimate of 
the (001) APB energy per site of the antiphase plane: 
^APB ^ 4[eD023 _ ^;£'022], In Table HH our calcu- 
lated E^^^ = 101.6 meV and CE3-predicted E^^^ = 
101.1 meV agree at perfect long-range order. However, 
binaries with first-order transitions have order parame- 
ters 7] (defined in ^3) that jump from to 0.7-0.9 at Tc. 
For partial order below Tc as in experiment, we predict 
that E^§^{ri) are 81, 65, and 50meV for t^'s of 0.9, 0.8, 
and 0.7, respectively. From superdislocation separation 
measurements, assessed values are 52 ± 20 meV at 273 K 
and 55 ± 18meV at 900 K fiTi, with roughly constant 
rj < 1 due to lack of kinetics. 

The real-space Warren-Cowley SRO parameters ckjmn 
were calculated using our CE within Monte Carlo at 
r=1.04 Tc, as in experiment. The proper way to com- 
pare calculated SRO to experimental data is in 0]. Full 
details of the agreement between calculated and exper- 
imental aimn will be given elsewhere. However the en- 
ergetics associated with SRO given by AE^^^q^'^^^ w 
ksT (a-HlOO) - a~i(li0)) /16c(l - c) can be directly 
estimated from the calculated a(k) at {1^0} and {100} 
k-points, as done experimentally 17]. We obtain 17 ± 
15 meV for CE3 at 1392 K, now in agreement with exper- 
iment 1171 and coherent-potential approximation (CPA) 
results [13], see Tabled Our results confirm the CPA ex- 
planation for Ni3V SRO energetics and the discrepancy 
between T = OK DFT results and measurements as aris- 
ing from the str ong dependence of E^^^{ri) on the state 
of partial order |l9| . 

Finally, we discuss issues that led to previous poor 
Ni3V results. For the range of ECls included in our 
truncated CE basis sets, AE^]^^°°^' = [eI^]^ - E^e^] = 
2[E^l''^ -E^l'''']=E^P^ /2, as verified in Table|ll| So for 
a truncated CE, LI2 can be viewed also as a (001) APB 
in L»022. Other LPS, e.g., with Edft = 33.7meV/atom 
and E^p^ — 25.4meV/atom in Table IhI also have in- 
distinguishable energies within the truncated CE. This 
observation has great import in Ni3V. Table shows 
that AE^]^-^'^^^ and AE^%^'='^^ are not equal! This 
implies again that there is a strong configurational de- 
pendence of partially-ordered LI2 energy on 77, as indeed 
shown by CPA calculations Because LI2 is highly 

metastable with respect to DO22, a truncated CE will be 
suspect versus i] (particularly for 77 ~ 1) unless all clus- 
ters that distinguish LI2 from DOp, and similar LPS, are 
included in the basis. In Refs.EHliE AE^]^~'^'^^^ at 
K was forced to coincide with AE^%^'^^^ = 101 meV 
by overfitting and including certain 3- and 4-body clus- 
ters arbitrarily that created an invalid CE and hence 
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inaccurate energetics. Our truncated CE properly de- 
scribes the observed thermodynamics, but not high- 
energy and iU-described structures hke fully-ordered LI2 
that are unimportant for thermodynamics, as evidenced 
in Table ITU Of course, calculating more DFT structural 
energies and properly extending the CE basis to include 
critical n-body clusters would describe everything more 
reliably. 

Generally speaking, neglecting stronger interactions 
and assigning their weight to weaker and less physically 
important longer-range [or higher-order] ones, i.e. vio- 
lating Rule 1 [or Rule 2] , leads to inaccurate predictions 
of energetics. Overfitting (neglecting Rule 3) results in 
large errors in predicted energies, which were not used in 
the fit. Combined violations can result in dramatic fail- 
ures: for example, previous CE for Ni-V overfitted 
energies (violating Rule 3) and included, e.g., a longer- 
ranged 4-body before the most compact one (violating 
Rule 1) - hence the disagreement with experiment and 
CPA. Previous CE results are generally valid if only Rules 
1 and 2 were obeyed with no large overfitting; in such 
cases, minimizing the CV error leads only to a moderate 
improvement of accuracy. The optimal CE basis trun- 
cation presented has an error that is variational, which 
is not necessarily the case for other basis-set truncation 
and reduction methods. 

In summary, the cluster-expansion method is a valu- 
able first-principles tool for predicting and interpreting 
thermodynamic behavior in alloys. With convergence 
and reliability in mind, we presented a simple varia- 
tional method for the optimal truncation of the cluster- 
expansion basis set. We presented the method as a set 
of rules that are computationally easy to implement. For 
a given set of DFT structural energies and no a priori 
knowledge of which clusters are needed to represent a 
particular alloy well, our truncation method provides a 
unique optimal choice of clusters based on their contri- 
bution to thermodynamics and variational reduction in 
error, avoids choosing clusters by intuition, and gives re- 
liable thermodynamic predictions. We exemplified the 
importance of this new approach in fee NiaV by predict- 
ing important new metastable structures and by showing 
agreement with order-disorder temperature, anti-phase 
boundary energy, and short-range order energetics, all 
quantities missed by previous cluster-expansion appli- 
cations. We also elucidated the origin of the previous 
failures. Without a priori information, the new cluster- 
expansion strategy allows reliable thermodynamic pre- 
dictions in alloys. 
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